A novel keyframe extraction method for video classification using deep neural networks

نویسندگان

چکیده

Abstract Combining convolutional neural networks (CNNs) and recurrent (RNNs) produces a powerful architecture for video classification problems as spatial–temporal information can be processed simultaneously effectively. Using transfer learning, this paper presents comparative study to investigate how temporal utilized improve the performance of when CNNs RNNs are combined in various architectures. To enhance identified effective combination CNN RNN, novel action template-based keyframe extraction method is proposed by identifying informative region each frame selecting keyframes based on similarity between those regions. Extensive experiments KTH UCF-101 datasets with ConvLSTM-based classifiers have been conducted. Experimental results evaluated using one-way analysis variance, which reveals effectiveness sense that it significantly accuracy.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Classification-Based Singing Melody Extraction Using Deep Convolutional Neural Networks

Singing melody extraction is the task that identifies the melody pitch contour of singing 1 voice from polyphonic music. Most of the traditional melody extraction algorithms are based on 2 calculating salient pitch candidates or separating the melody source from the mixture. Recently, 3 classification-based approach based on deep learning has drawn much attentions. In this paper, 4 we present a...

متن کامل

A Novel Tool for Quick Video Summarization using Keyframe Extraction Techniques

The increasing availability of short, unstructured video clips on the Web has generated an unprecedented need to organize, index, annotate and retrieve video contents to make them useful to potential viewers. This paper presents a novel, simple, and easy-to-use tool to benchmark different low level features for video summarization based on keyframe extraction. Moreover, it shows the usefulness ...

متن کامل

Novel Keyframe Extraction for Video Content Summarization using LBG Codebook Generation Technique of Vector Quantization

In the current era, most of the digital information in the form of multimedia with a giant share of videos. Videos do have audio and visual content where the visual content has number of frames put in a sequence. Most of the consecutive frames do have very little discriminative contents. In video summarization process, several frames containing similar information are needed to get processed. T...

متن کامل

Surgical video retrieval using Deep Neural Networks

Although the amount of raw surgical videos, namely videos captured during surgical interventions, is growing fast, automatic retrieval and search remains a challenge. This is mainly due to the nature of the content, i.e. visually non-consistent tissue, diversity of internal organs, abrupt viewpoint changes and illumination variation. We propose a framework for retrieving surgical videos and a p...

متن کامل

Audio event classification using deep neural networks

We present in this paper our work on audio event classification for outdoor events. As the main classification method we employ a deep neural network (DNN) and compare this to other classification methods. We propose a novel improvement to the pre-training process of the network which is useful when training with Gaussian data. Our experimental results are based on an audio corpus extracted fro...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Neural Computing and Applications

سال: 2021

ISSN: ['0941-0643', '1433-3058']

DOI: https://doi.org/10.1007/s00521-021-06322-x